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Abstract. In this paper we address the problem of generating all ele¬ 
ments obtained by the saturation of an initial set by some operations. 

More precisely, we prove that we can generate the closure by polymor¬ 
phisms of a boolean relation with a polynomial delay. This implies for 
instance that we can compute with polynomial delay the closure of a 
family of sets by any set of “set operations” (e.g. by union, intersection, 
difference, symmetric difference...). To do so, we prove that for any set 
of operations J-, one can decide in polynomial time whether an elements 
belongs to the closure by of a family of sets. When the relation is over 
a domain larger than two elements, our generic enumeration method fails 
for some cases, since the associated decision problem is NP-hard and we 
provide an alternative algorithm. 

1 Introduction 

In enumeration complexity we are interested in listing a set of elements, which 
can be of exponential cardinality in the size of the input. The complexity of these 
problems is thus measured in term of the input size and output size. The enumer¬ 
ation algorithm with a complexity polynomial in both the input and output are 
called output polynomial or total polynomial time. Another, more precise notion 
of complexity, is the delay which measures the time between the production of 
two consecutive solutions. We are especially interested in problems solvable with 
a delay polynomial in the input size, which are considered as the tractable prob¬ 
lems in enumeration complexity. For instance, the maximal independent sets of 
a graph can be enumerated with polynomial delay [1]. 

If we allow the delay to grow during the algorithm, we obtain incremental 
delay algorithms: the first k solutions can be enumerated in a time polynomial 
in k and in the size of the input. Many problems which can be solved with 
an incremental delay have the following form: given a set of elements and a 
polynomial time function acting on tuples of elements, produce the closure of 
the set by the function. For instance, the best algorithm to generate all circuits 
of a matroid is in incremental delay because it uses some closure property of the 
circuits [5]. 

Polynomial delay algorithms are in incremental delay. In this article, we try 
to understand when saturation problems which are natural incremental delay 


problems can be in fact solved by a polynomial delay algorithm. To attack this 
question we need to restrict the saturation operation. In this article, an element 
will be a vector over some finite set and we ask the saturation operation to act 
coefficient-wise and in the same way on each coefficient. We prove that, when the 
vector is over the boolean domain, every possible saturation can be computed 
in polynomial delay. To do that we study a decision version of our problem, 
denoted by Closure jr: given a vector v and a set of vectors S decide whether 
V belongs to the closure of S by the operations of iF. We prove ClosurEjt g P 
for all set of operations J- over the boolean domain. 

When the domain is boolean, the problem can be reformulated in term of set 
systems or hypergraphs. It is equivalent to generating the smallest hypergraph 
which contains a given hypergraph and which is closed by some operation. We 
show how to efficiently compute the closure of an hypergraph by any family of 
set operations (any operation that is the composition of unions, intersections and 
complementations) on the hyperedges. This extends known methods such as the 
closure of a hypergraph by union, by union and intersection or the generation 
of the cycles of a graph by computing the closure of the fundamental cycles by 
symmetric difference. 

The closure computation is also related to constraint satisfaction problems 
(CSP). Indeed, the set of vectors can be seen as a relation R and the problem 
of generating its closure by some operation / is equivalent to the computation 
of the smallest relation R' containing R such that / is a polymorphism of R'. 
There are several works on enumeration in the context of CSP, which deal with 
enumerating solutions of a CSP in polynomial delay mm- The simplest such 
result [3] states that in the boolean case, there is a polynomial delay algorithm if 
and only if the constraint language is Horn, anti-Horn, bijunctive or affine. Our 
work is completely unrelated to these results, since we are not interested in the 
solutions of CSPs but only in generating the closure of relations. However, we 
use tools from CSPs such as the Post’s lattice [6], used by Schaefer in its seminal 
paper [7], and the Baker-Pixley theorem [5]. 

The main theorem of this article settles the complexity of a whole family 
of decision problems and implies, quite surprisingly, that the backtrack search 
is enough to obtain a polynomial delay algorithm to enumerate the closure of 
boolean vectors. For all these enumeration problems, compared to the naive 
saturation algorithm, our method has a better time complexity (even from a 
practical point of view) and a better space complexity (polynomial rather than 
exponential). 

Moreover these algorithms may serve as a good tool to design other enumera¬ 
tion algorithms. One only has to express an enumeration problem as the closure 
of some sufficiently small and easy to compute set of elements and then to apply 
the method described in this article. Similarly, given a potentially large set of 
objects, it might be convenient to represent this set in a compact way with only 
some ’’essential” elements. We would like that these core elements act as a basis 
in the sense that any other element can be found by composing one or several 
operations on those base elements. If one can define such a basis with respect to 



a set of operations, a natural question arises: How to compute efficiently all the 
elements from the basis ? The results of this paper answer this question when 
the considered objects are subsets of a ground set and when the operations are 
’’set operations”. 

Finally, besides the generic enumeration algorithm, we try to give for each 
closure rule an algorithm with the best possible complexity. In doing so, we 
illustrate several classical methods used to enumerate objects such as amortized 
backtrack search, reverse search. Gray code ... 


1.1 Our results 


In Sec. we define enumeration complexity, our problem and the backtrack 
search. In Sec. we use Post’s lattice, restricted through suitable reductions 
between clones, to determine the complexity of Closure jr for all set of binary 
operations T. It turns out that there are only a few types of closure operations: 


1. the monotone operations, in subsection |3.1| 

2. the addition over F 2 , in subsection |3. 2 


3. the set of all operations, or almost all, in subsection |3.3| 

4. an infinite hierarchy of increasingly hard to enumerate closures, related to 
the majority function, in subsection |3.4| 


5. the few limit cases of the previous hierarchy in subsection 3.5 


Finally, in Sec. we give polynomial delay algorithm for three classes of 
closure operation over any domain and prove that the method we use in the 
boolean case fails in one case. 


2 Preliminary 

2.1 Basic notations 

Given n G N, [n] denotes the set {1,..., n}. For a set D and a vector v G H", we 
denote by Vi the coordinate of v. Let i,j G [n], we denote by Vij the vector 
(vi,Vj). More generally, for a subset / = of [n] with ii < ... < we 

denote by vj the vector (uq,..., ). Let 5 be a set of vector we denote by Sj the 

set {vi I V G S}. The characteristic vector u of a subset E of [n], is the vector in 
{0,1}” such that Ui = 1 if and only \i i G X. 

2.2 Complexity 

In this section, we recall basic definitions about enumeration problems and their 
complexity, for further details and examples see [5]. 

Let S be some finite alphabet. An enumeration problem is a function A 
from E* to 'P{E*). That is to each input word, A associates a set of words. 
An algorithm which solves the enumeration problem A takes any input word w 
and produces the set A{w) word by word and without redundancies. We always 




require the sets A{w) to be finite. We may also ask A{w) to contain only words 
of polynomial size in the size of w and that one can test whether an element 
belongs to A{w) in polynomial time. If those two conditions hold, the problem 
is in the class EnumP which is the counterpart of NP for enumeration. Because 
of this relationship to NP, we often call solutions the elements we enumerate. 

The computational model is the random access machine model (RAM) with 
addition, subtraction and multiplication as its basic arithmetic operations. We 
have additional output registers, and when a special OUTPUT instruction is 
executed, the contents of the output registers is outputted and considered as an 
element of the outputted set. We choose the RAM model, because it is closer to 
real computers and because it allows to store all found solutions and to look for 
one in logarithmic time in the number of solutions. 

The delay is the time between the productions of two consecutive solutions. 
Usually we want to bound the delay of an algorithm for all pairs of consecu¬ 
tive solutions and for all inputs of the same size. If this delay is polynomial 
in the size of the input, then we say that the algorithm is in polynomial delay 
and the problem is in the class DelayP. If the delay is polynomial in the input 
and the number of already generated solutions, we say that the algorithm is in 
incremental delay and the problem is in the class IncP. By definition we have 
DelayP C IncP. Moreover (DelayP n EnumP) ^ (IncP n EnumP) modulo the 
exponential time hypothesis [10] . In practice problems in DelayP are much more 
tractable, often because they can be solved with a memory polynomial in the size 
of the input. Note that in a polynomial delay algorithm we allow a polynomial 
precomputation step, usually to set up data structures, which is not taken into 
account in the delay. This is why we can have a delay smaller than the size of 
the input. 

We now explain a very classical and natural enumeration method called the 
Backtrack Search (sometimes also called the flashlight method) used in many 
previous articles mm- We represent the solutions we want to enumerate as 
vectors of size n and coefficients in [d]. In practice solutions are often subsets 
of [n] which means that k = 1 and the vector is the characteristic vector of the 
subset. 

The enumeration algorithm is a depth first traversal of a tree whose nodes 
are partial solutions. The nodes of the tree will be all vectors v of size I, for all 
I < n, such that v = W[i] and w is a solution. The children of the node v will 
be the vectors of size / -I- 1, which restricted to [/] are equal to v. The leaves of 
this tree are the solutions of our problem, therefore a depth first traversal will 
visit all leaves and yield all solutions. We want an enumeration algorithm with a 
delay polynomial in n. Since a branch of the tree is of size n, we need to be able 
to find the children of a node in a time polynomial in n to obtain a polynomial 
delay. The delay also depends linearly on d, but in the rest of the paper d will be 
constant. Therefore the problem is reduced to the following decision problem: 
given V of size I is there w a solution such that v = icj;] ? This problem is called 
the extension problem associated to the enumeration problem. 




Proposition 1. Given an enumeration problem A, such that for all w, A(w) 
can be seen as vectors of size n and coefficients in [d], with n and d polynomially 
related to |w|. If the extension problem associated to A is in P, then A is in 
DelayP. 

More precisely, the delay is n times the complexity of solving the extension 
problem times d. We will see in the next part, that the complexity of solving the 
extension problem can be amortized over a whole branch of the tree, since we 
solve it many times, using well chosen data structures. 

There is a second enumeration method to design a polynomial delay algo¬ 
rithm, named the supergraph method |13j . The idea is to organize the solutions 
(and not the partial solutions) as a DAG instead of a tree, and to traverse this 
DAG. For that we should be able to visit all the successors of a node in polyno¬ 
mial time. To avoid to enumerate several times a node of the DAG, the reverse 
search method is often used [13] . It consists on defining a canonical parent com¬ 
putable in polynomial time for each node. This method may have a better delay 
than the backtrack search, because the traversal goes over solutions only. 

2.3 Closure of families by set operations 

In this subsection, we define the family of enumeration problems we want to 
solve. We fix D a finite domain. Given a t-ary operation / (a function from 
to D), / can be naturally extended to a t-ary operation over vectors of the same 
size. For a t-uples of vectors of size n / will then acts coefficient-wise, 

that is for all i < n, f{v^, ■ ■ ■, v*)i = f{vj ,..., u|). 

Definition 1. Let IF be a finite set of operations over D. LetS be a set of vectors 
of size n over D. Let = {f{vi,... ,Vt) | ui,...,Uj € and f G J-} 

and IF'^{S) = S. The closure of S by IF is Cljr{S) = 

Remark that Cljr{S) is also the smallest set which contains S and which is 
closed by the operations of IF. The set Cljr{S) is invariant under the operations 
of IF: these operations are called polymorphisms of the set Cljr(S), a notion 
which comes from universal algebra. 

As an illustration, assume that D = {0,1} and that IF = {V}. Then the ele¬ 
ments of S can be seen as subsets of [n] (each vector of size n is the characteristic 
vector of a subset of [n]) and Closure^v} (5) is the closure by union of all sets in 
5. Let 5 = {{1,2,4}, {2,3}, {1,3}} then Cl{y}{S) = {{1,2,4}, {1,2,3,4}, {2,3}, {1,3}, {1,2,3}}. 
Remark that Cl[yy{S) is indeed closed by union, that is V is a polymorphism of 
C'Z{v}(5). 

We now introduce the family of enumeration problems, parametrized by J- 
a set of operations over D, which we will try to solve in this article. 

EnumClosurejr 
Input; A set of vectors S 
Output: Cljr(^S) 


In all this article, we will denote the size of the vectors of 5 by n and the 
cardinal of S by m. We introduce two related decision problems. First, the exten¬ 
sion problem associated to a set of operations T, is the problem ExtClosurEjt: 
given S a set of vectors of size n, and a vector v of size I < n, is there a vec¬ 
tor v' e Cljr{S) such that V[/] = v'. Second, the closure problem, denoted by 
Closure jr, is a restricted version of the extension problem where v is of size n. 

Proposition 2. If Closure jr g P then EnumClosurejc g DelayP. 

Proof. ExtClosurEjt can be reduced to Closure jr- Indeed, given a vector 
V of size Z, because the operations of P act coordinate-wise, the two following 
predicates are equivalent: 

— 3v' g Cljr{S) such that V[i] = v' 

- V € Cljr(S[l]) 

Therefore if Closure jr S P then we have also ExtClosurejf € P. We can use 
Prop. [2 to conclude. □ 

We have introduced an infinite family of problems, whose complexity we 
want to determine. Several families of operations may always produce the same 
closure. To deal with that, we need to introduce the notion of functional clone. 

Definition 2. Let P be a finite set of operations over D, the functional clone 
generated by P , denoted by < P >, is the set of operations obtained by any 
composition of the operations of P and of the projections : D" —>■ D defined 
by 7r^(a;i,... ,a;„) = Xk- 

This notion is interesting, because two sets of functions which generate the 
same clone applied to the same set produce the same closure. 

Lemma 1. For all set of operations P and all set of vectors S, Cljr{S) = 

Cl<T>{S). 


The number of clones over D is infinite even when D is the boolean domain. 
However, in this case the clones form a countable lattice, called the Post’s lat¬ 
tice [6]. Moreover there is a finite number of well described clones plus a few 
very regular infinite family of clones. 


3 The Boolean Domain 

In this part we will prove our main theorem on the complexity of Closurejf, 
when the domain is boolean (of size 2). An instance of one such problem, denoted 
by S, will be indifferently seen as a set of vectors of size n or a set of subsets of 
[n]. 

Theorem 1. Let P be any fixed finite set of operations over the boolean domain, 
then Closurejf G P and EnumClosurejt g DelayP. 


There is also a uniform version of the decision problem, where F is given as 
input. It turns out that this problem is NP-hard as proven in Section [3^ 

To prove our main theorem, we will prove that Closure_f S P, for each clone 
T of the Post’s lattice. We first show that for some T the problem Closure jr 
can be reduced to Closure^ where Q is another clone obtained from T. This 
helps to reduce the number of cases we need to consider. 

To an operation / we can associate its dual / defined by /(si,...,st) = 

..., If is a set of operations, T is the set of duals of operation 
in T. We denote by 0 and 1 the constant functions which always return 0 and 

1. By a slight abuse of notation, we will also denote by 0 the all zero vector and 
by 1 the all one vector. 

Proposition 3. The following problems can be polynomially reduced to Closure jr 

1. Closure jru{o}, ClosureClosure jru{o.i} 

2. Closure^. _ 

3. Closure jru{^} when F = F 

Proof. The reductions follow easily from these observations: 

1. Cl^uiniS) = ClriS u {f}) for / = 0 or / = 1 and 5 ^ 0. 

2. Cl^{S) = Cljr{S) where S is the set of negation of vectors in S. 

3. (iS) = Cljr{S U S) since for every f £ F, there exists g £ F such 

that -■{f{vi,...,vt) = g{^vi,...,^vt). 

□ 

In the following picture, we represent the clones which cannot be reduced 
to another one using Prop. and that we will investigate in this article. For a 
a modern presentation of all boolean clones, their bases and the Post’s lattice 

see [13]. 

3.1 Conjunction 

We first study one of the simplest clone: E 2 =< A >. We give an elementary 
proof that Closures^ S P, then we explain how to obtain a good delay for 
EnumClosure^;^. For a binary vector v, let us denote by <S){v) (resp. l(n)) the 
set of indices i for which Vi = 0 (resp. Vi = 1). 

Proposition 4. Closure^;^ € P. 

Proof. Let 5 be a set of boolean vectors, if we apply A to a couple of vectors 
in S it produces the intersection of two vectors when seen as sets. Since the 
intersection operation is associative and commutative, CIe 2 {S) is the set of ar¬ 
bitrary intersections of elements of S. Let be a vector and let be the set 
{re € S I = 1 }. Assume now that v can be obtained as an intersection of 

elements Vi,... ,Vt, those elements must be in because of the monotonicity 
of the intersection for the inclusion. On the other hand, by definition of 5i, v 
will always be smaller or equal to Therefore, v G Cls^iS) if and only if 

V = This intersection can be computed in time 0{mn) which concludes 

the proof. □ 
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Fig. 1. The reduced Post’s lattice, the edges represent inclusions of clones 


By Prop. we can turn the algorithm for ClosurE £;2 into an enumeration 
algorithm for EnumClosure^^ with delay O(mn^). We show in the next propo¬ 
sition how to reduce this delay to 0{mn), which is the best known complexity 
for this problem. 

Proposition 5. There is an algorithm solving EnumClosure^;^ with a delay 
Olrnn). 

Proof. We use the backtrack search described in Prop, [l] but we maintain data 
structures which allows to decide Closure^;^ quickly. Let S be the input set 
of m vectors of size n. During the traversal of the tree we update the partial 
solution p, represented by an array of size n which stores whether pi = 1, pi = 0 
or is yet undefined. 

A vector u of 5 is compatible with the partial solution if Ip C Ij, . We maintain 
an array COMP indexed by the sets of S, which stores whether each vector of 
S is compatible or not with the current partial solution. Finally we update an 
array COUNT, such that COUNT[i] is the number of compatible vectors v G S 
such that Vi = 0. Remark that a partial solution p can be extended into a vector 



























of CIe 2 {S) if and only if for all i € Op COUNT[i\ > 0, the solution is then the 
intersection of all compatible vectors. 

At each step of the traversal, we select an index i such that pi is undefined 
and we set first = 0 then p^ = 1. When we set pi = 0, there is no change 
to do in COUNT and COMP and we can check whether this extended partial 
solution is correct by checking if COUNT\i] > 0 in constant time. When we set 
Pi = 1, we need to update COMP by removing from it every vector v such that 
Vi = 0. Each time we remove such a vector v, we decrement COUNT[j] for all j 
such that Vj = 0. If there is a j such that COUNT[j] is decremented to 0 then 
the extension of p by pi = 1 is not possible. 

When we traverse a whole branch of the tree of partial solutions during the 
backtrack search, we will set pi = 1 for each i at most once and then we need 
to remove each vector from COMP at most once. Therefore the total number 
of operations we do to maintain COMP and COUNT is 0{mn) and so is the 
delay. □ 

Remark that if we ask for all extensions of the sets instead of all intersections, 
we exactly get the problem of enumerating the solutions of a monotone DNF 
formula. In fact the algorithm used here is exactly the same as the best one to 
generate the solutions of a DNF formula. Moreover, we can reduce the problem 
of enumerating the solutions of a monotone DNF formula to EnumClosure£) 2 . 
The reduction we use is called parsimonious reduction and is relevant for count¬ 
ing and enumeration complexity (see [5]). It maps an instance of a problem to 
one of another problem in polynomial time and there is a bijection between the 
solutions associated to those instances which can be computed in polynomial 
time. For instance, since Cl-i^{S) = Cly{S) the problem of generating intersec¬ 
tions of hyperedges reduces to the problem of generating unions of hyperedges. 


Proposition 6. There is a parsimonious reduction from monotone DNF to 
EnumClosurE{v} . 

Proof. Let (j) = where the Ci are clauses over the variables cci,..., a;„. 

We build an hypergraph H over the domain [n]. For each clause Ci, let be the 
hyperedge {i \ Xi € Ci} and we also add the hyperedges U {xj} for all Xj ^ E. 
There is a bijection between the union of hyperedges and the solutions of the 
formula </>. □ 


Since the reduction is parsimonious, the problems of counting the elements 
of Cly and CId 2 are jJP-hard, while their enumerations are easy. Determining 
the exact complexity of CLOSUREy is an intriguing open problem: is it possible 
to design an algorithm with a complexity sublinear in m or even which depends 
on n only ? Even when the input hypergraph is a graph (every set in S is of 
size 2), m is bounded by and the question of solving CLOSUREy with a delay 
better than 0{n^) is open. 


3.2 Algebraic operations 


We first deal with the clone Lq =< + > where + is the boolean addition. 
Note that CIlo{S) is the vector space generated by the vectors in S. Seen as an 
operation on sets, it is the symmetric difference of the two sets. 

Proposition 7. Closure^^ G P. 

Proof. Let S be the set of input vectors, let u be a vector and let A be the 
matrix whose rows are the elements of S. The vector v is in ClLg{S) if and only 
if there is a solution to Ax = v. Solving a linear system over F 2 can be done in 
polynomial time which proves the proposition. □ 

The previous proposition yields a polynomial delay algorithm by applying 
Prop.j^ One can get a better delay, by computing in polynomial time a maximal 
free family M of S, which is a basis of CIlo{S). The basis M is a succinct 
representation of ClLg{S). One can generate all elements of ClLg{S) by going 
over all possible subsets of elements of M and summing them. The subsets can 
be enumerated in constant time by using Gray code enumeration (see |15j ). The 
sum can be done in time n by adding a single vector since two consecutive 
sets differ by a single element in the Gray code order. Therefore we have, after 
the polynomial time computation of M, an enumeration in delay 0(n). If one 
allows to output the elements represented in the basis M, the algorithm even 
has constant delay. 

With some care, we can extend this result to the clone L 2 generated by the 
sum modulo two of three elements. 

Proposition 8. CLOSUREi^ G P. 

Proof. First remark that any vector in ClL^iS) is the sum of an odd number of 
vectors in S. In other words v G CIl 2 {S) if and only if there is a a; such that 
Ax = V and that the Hamming weight of x is odd. One can compute a basis B 
of the vector space of the solutions to the equation Ax = u. If all elements of B 
have Hamming weight even, then their sums also have Hamming weight even. 
Therefore v G CIl^ {S) if and only if there is an element in B with odd Hamming 
weight, which can be decided in polynomial time. □ 

3.3 Conjunction and disjunction 

In this subsection, we deal with the largest possible clones of our reduced Post 
lattice: M 2 =< A, V >, i ?2 =< xl y : z > and Rq =< V, + >. 

Proposition 9. CLOSUREM 2 G P- 

Proof. Let 5 be a vector set and for all i S [n], let := {u S 5 | = 1}. We 

will show that a vector u belongs to ClM^iS) if and only if u := \J /\ v- 

i£lL{u) v^Xi 

Clearly, if M := V /\ u then u S C/ms (‘5). 

V^Xi 


Assume first that there exists i G l(u) such that Xi = 0 i.e. for all v G S, 
Vi = 0. Then clearly, for all w G CIm 2 {S), Wi = 0 and then u ^ Assume 

now that Xi ^ 0 for all i G IL(m) and assume that u t := \/ /\ v. So there 

iGl('u) vGXi 

exists j £ 0(m) such that tj = 1. Thus, there exists i G 1(m) such that for all 

V G Xi, Vj = 1. We have that for all v G S, Vi = 1 => vj = 1. Let us show 
that this property is preserved by both operations A and V and then that this 
property holds for all w G ClMyi^)- Assume that the property holds for a set 
J-. Let a,b G J- and let v '.= a /\h. li Vi = 1, we have = 1 and bi = 1 and 
then Qj = 1 and bj = 1. We conclude that Vj = aj A bj = 1. Assume now that 

V = ay b and that Vi = 1. Then either = 1 or = 1, say w.l.o.g. that = 1. 

Then aj = 1 and we have Vj = aj y bj = 1. We have shown that the property is 
preserved by both operations, therefore u cannot belong to ClMyiS) since Ui = \ 
and Uj =0. □ 

When we examine the previous proof, we see that the complexity of deciding 
Closurem 2 is O(mn^) therefore by applying Prop. we get an enumeration 
algorithm with delay We can precompute the n vectors 

and generate their unions in delay 0{n?) thanks to Prop.[^ By an hill climbing 
algorithm, using the inclusion structure of the x* we can obtain a 0{n) delay. 

Proposition 10. EnumClosurems can be solved with delay 0(n). 

Proof. Let S be the input. We first build the x* = A 'C- The inclusion is a 

vGSiVi — 1 

partial order between the Xi, we extend it into some total order T by topological 
sorting. We then generate all elements of Cl M 2 (S) by an Hill climbing algorithm: 
we go from one solution to another by adding a single xL Let v be the current 
solution, we maintain a list L of the indices i of v such that Vi = 0. At each step 
we select i the first element of L and we set Vj = 1 and remove j from L for all 
j G l(x*). This produces a new solution in time 0(n). We then recursively call 
the algorithm on this new solution and list. When the recursive call is finished, 
we call the algorithm on v and L \ {i}. 

This algorithm is correct, because the solutions generated in the two recursive 
calls are disjoint. Indeed, in the second call Vi will always be 0, because all 
indices in L are smaller than i in T. It means that x^ for j G L is either smaller 
or incomparable. Since x* is the smallest element with x* = I it implies that 
xj = 0 . □ 

If we consider EnumClosurEm 2 u{^}(‘ 5), it is very easy to enumerate. Let 
AT® = {u I u £ 5, Xi = 1} U {-'X I X £ 5,Xi = 0} and let x® = AveXi The 
set CIm 2 U{^}{S) is in fact a boolean algebra, whose atoms are the x®. Indeed, 
either x® ^ = x^j and they are equal or l 2 ,i n = 0. Let A = {x® | i G [n]}, two 
distinct unions of elements in A produce distinct elements. Hence by enumerating 
all possible subsets of A with a Gray code, we can generate CIm 2 u{^}(‘^) with 
a delay 0(n) (even 0(1) when always equal coefficients are grouped together). 

The closures by the clones i ?2 and i?o are equal to the closure by M 2 U{^} up 
to some coefficients which are fixed to 0 or 1, thus they are as easy to enumerate. 


Proposition 11. The problems Closure^Jj; Closure can be reduced to Closurems 
in polynomial time. 

Proof. Let 5 be a set of binary vectors. If for some i, for all v G S, Vi = 0 
(resp. 1) then for all w G CIr^{S), Wi = 0 (resp. 1). Therefore, we can assume 
that for all i, there is u and u in 5 such that Ui = 0 and Vi = 1. Remark 
that xl X : y = x\/ y thus by the previous assumption we can generate 1. Let 
w’- = . By assumption, we can chose such that u\ = 0. We 

set Wo = uq and by a trivial induction Wn = 0. Now remark that x ? 0 : 1 = x. 
Therefore we have Cl]i^{S) = Cl^y^^y and the problem Closure<v,^> can be 
polynomially reduced to Closurems by point 3 of Prop. 

If for some i, for all v G S, Vi = 0 then for all w G Clji^{S), Wi = 0. 
Therefore we can assume that for all i, there isu G S such that Ui = 1. Therefore, 

1 G Cliig{S) by doing the union of the elements it* such that u\ = 1. Finally, 

X + 1 = X therefore we also have ClRg{S) = C'Z<v,^>. □ 

3.4 Majority and threshold 

An operation / is a near unanimity of arity k if it satisfies /(xi, X 2 ,..., x^) = x 
for each fc-tuple with at most one element different from x. The threshold function 
of arity k, denoted by Th\_^^ is defined by Th\_^{xi ,..., Xk) is equal to 1 if and 
only if at least A: — I of the elements xi,... ,Xk are equal to one. It is the smallest 
near unanimity operation over the booleans. The threshold function Th^ is the 
majority operation over three booleans that we denote by maj and the clone it 
generates is D 2 . We first give a characterization of CId 2 {S) which helps prove 
that Closured^ G P. The characterization is a particular case of a universal 
algebra theorem that we then use to compute the closure by any clone which 
contains a threshold function. 

Lemma 2. Let S be a vector set, a vector v belongs to Clo^iS) if and only if 
for all i,j G [n], i j, there exists x G S such that Xij = Vij. 

Proof. (=>) Given a,b G {0,1} and i,j G [n], i ^ j, we first show that if for 
all V G S, Vi ^ a or Vj b then for all u G Clo^iS), Vi a or Vj ^ b. It is 
sufficient to prove that this property is preserved by applying maj to a vector 
set i.e. that if S has this property, then maj(S) has also this property. Let 
x,y,z G S, V := maj{x,y,z), and assume for contradiction that Vij = {a,b). 

Since Vi = a, there is at least two vectors among (x, y, z} that are equal to a 
at index i. Without loss of generality, let x and y be these two vectors. Since 
for all u G S, Ui ^ a or Uj b, we have Xj 7 ^ b and yj b and then vj 7 ^ b 
which contradicts the assumption. We conclude that if u G Clo^iS), then for all 
i,j G [n], there exists u G S with Vij = Uij. 

(<;=) Let k < n and let ai,...,afc G {0,1}. We will show by induction on k, 
that if for all i,j < k there exists v G S with Vi = Oi and Vj = aj, then there 
exists u G Clovis) with ui = oi, U 2 = 02 , ..., Uk = Ofc. The assertion is true 
for k = 2. Assume it is true for fc — 1, and let oi, ...,ak G {0,1}. By induction 


hypothesis there exists a vector w G CId 2 (S) with wi = oi, Wk-i = Ofe-i- 
By hypothesis, for alH < fc there exists G S with w- = and v], = a^- We 
then construct a sequence of vectors {u^)i<k as follow. We let v} = and for 
all 1 < i < fc, It* = maj{w,u^~^,v'‘). We claim that u := has the property 
sought i.e. for all i < k, Ui = Ui. First let prove that for all * < fc and for all 
j < *, M* = ttj. It is true for ui by definition. Assume now that the property 
holds for i < k. Then, by construction, for all j < i — 1, we have rt* = aj 

since wj = aj and = Uj. Furthermore, we have u* = maj{wi,ul~^,vl) = at 
since Wi = ai and vt = at. We conclude that for alH < fc — 1, ut = = a^. 

We claim now that for all i < k, u\ = ak- It is true for tt^. Assume it is true 
for i < k. Then we have u]. = maj{wk,u]7^,vl.) which is equal to ak since 
u]r^ = ak by induction and = ak by definition. We then have Ui = at for all 
i < k which concludes the proof. □ 

Corollary 1. Closured^ is polynomial. 

Proof. Using Lemma[^ one decides whether a vector v is in Clo^iS), by consid¬ 
ering every pair of index i,j and checking whether there is a vector w G S such 
that Vi,j = Wi,j. The complexity is in 0{inn^). □ 

By applying Prop[^ we get an enumeration algorithm in delay 0{mn^), and 
we explain how to improve this delay in the next proposition. 

Proposition 12. EnumClosured^ can he solved in delay O(n^). 

Proof. We do a backtrack search and we explain how to efficiently decide Closured^ 
during the enumeration. We first precompute for each pair (z, j) all values (a, 5) 
such that there exists v G S, Vij = (a, b). When we want to decide whether the 
vector V of size I can be extended into a solution, it is enough that it satisfies 
the condition of Lemma Moreover, we already know that f[z-i] satisfies the 
condition of Lemma[^ Hence we only have to check that the values of Vij for all 
i < I can be found in Siy which can be done in time 0{l). The delay is the sum 
of the complexity of deciding Closured^ for each partial solution in a branch: 
0(n2). □ 

It turns out that Lemma is a particular case of a general theorem of uni¬ 
versal algebra which applies to all near unanimity terms. However we felt it was 
interesting to give the lemma and its proof to get a sense of how the following 
theorem is proved. 

Theorem 2 (Baker-Pixley, adapted from [8j). Let P be a clone which con¬ 
tains a near unanimity term of arity k, then v G Cljr(^S) if and only if for all 
set of indices I of size k — 1, vj G Cljr{S)i. 

This allows to settle the case of Di =< maj,x y z > and of the two 
infinite families of clones of our restricted lattice Siq =< Th!^'^^,x f\{y\J z) > 
and Si 2 =< ,x f\{y ^ z) >. 


Corollary 2. If a clone J- contains then Closure is solvable in 0{mn^) 

In particular CLOSURE(iS'fQ), CLOSURE(S'f2) and Closure(Z?i) are in P. 

Proof. Let S bet a set of vectors and let f be a vector. By Th. e CiAS) 
if and only if for all J, vi € Cljr(S)j. First remark that Cljr{S)i = Cljr{Si) 
because the functions of P act coefficient-wise on S. The algorithm generates 
for each I of size k the set Cljr{Sj). For a given I, we first need to build the 
set Si in time m and then the generation of Cljr{Si) can be done in constant 
time. Indeed, we can apply the classical incremental algorithm to generate the 
elements in Cljr{Si), and the cardinal of Clj^iSi) only depends on k which is a 
constant. The time to generate all Cljr{Si) is 0{mn^) and then all the tests can 
be done in 0{n^). □ 

We have proved that the complexity of any closure problem in one of our 
infinite families is polynomial. Remark that we can use the method of Prop. 
to obtain a delay O(n^) for enumerating the elements of a set closed by a near 
unanimity function of arity k. Notice that we could have applied Theorem]^ to 
the clones of Subsection |3.3| which all contain the maj function. However, it was 
relevant to deal with them separately to obtain a different algorithm with delay 
0{n) rather than 0{n^). 

Notice that the complexity of Closurejt is increasing with the smallest arity 
of a near unanimity function in P. We should thus investigate the complexity of 
the uniform problem when the clone is given as input. We introduce the following 
restricted version which turns out to be hard. 

Closure Treshold 

Input: A set S of vectors and an integer k 
Output: Yes, if the vector 1 S Clgk^{S) 


Theorem 3. ClosureTreshold is coNP-complete. 

Proof. First notice that the problem is in coNP since by Theorem the answer 
to the problem is negative if and only if one can exhibit a subset of indices of I 
of size k such that no elements of Si is equal to 1. 

Let us show that the Hitting Set problem can be reduced to ClosureTreshold. 
Given a hypergraph P = (V,S), the Hitting set problem asks whether there 
exists a subset A C F of size k that intersects all the hyperedges of H. This 
problem is a classical NP-complete problem m- Let P = {V,£) he a hypergraph 
and k be an integer. Let P be the hypergraph on V whose hyperedges are the 
complementary of the hyperedges of P, and let S be the set of characteristic 
vectors of the hyperedges of P. Then P has a transversal of size k if and only 
if there is a set I of indices of size k such for all u S 5/, u 7 ^ 1. Indeed, / is a 
hitting set of P if for all A € £", there exists i € I such that i G E which implies 
that i E and then the characteristic vector u of i? is such that Vi = Q. 

Let us show that a set / of indices of size k is such that no element in Si is 
equal to 1 if and only if no element of Clgk (Si) is equal to 1. We assume that 


fc > 3 hence Siq =< >. Remark that if no element in Sj is equal to 1, 

then the application of to Si preserves this property. Indeed, let consider 

{v^,..., each u® has a zero coefficient and since there are fc +1 such 

vectors and the vectors are of size fc, by the pigeonhole principle, there are i,j, I 
such that vl = vf = 0. This implies that ..., 1. 

Since the other direction is straightforward, we have thus proved that there 
is a set / of indices of size fc such that for all u G 5/, u ^ 1 if and only if 
there is a set I of indices of size fc such that for all v G Clgk^{Si), u ^ 1. By 
Theorem [ 2 I the later property is equivalent to 1 ^ Clgk (S). Therefore we have 
given a pmynomial time reduction from Hitting set to the complementary of 
ClosureTreshold which proves the proposition. □ 

In fact, the result is even stronger. We cannot hope to get an FPT algo¬ 
rithm for ClosureTreshold parametrized by fc since the Hitting Set problem 
parametrized by the size of the hitting set is W[2]-complete [17]. It means that 
if we want to significantly improve the delay of our enumeration algorithm for 
the clone S^q, we should drop the backtrack search since it relies on solving 

ClOSUREo). . 

■^10 

3.5 Limits of the infinite parts 

Here we deal with the two cases left which are the limits of the two infinite 
hierarchies of clones we have seen in the previous subsection. Let begin with 
-S '12 =< X A {y ^ z) >. 

Remark 1. Let 5 be a vector set and assume that there exists a. i £ [n] such that 
for all V £ S, Vi = 1 (resp. Vi = 0) then for all w £ Clg^yi^) have Wi = 1 
(resp. Wi =0). Then we will assume in this section that for all i £ [n] there is 
at least a vector u in 5 with Vi = 1 and a vector w with Wi = 0 . 

Theorem 4. Let S he a vector set, a vector v belongs to ClsiyiS) if and only if 

— there exists w £ S such that l(w) C l[w) 

— for all {k,i) £ l(u)x0(u) there exists w £ S withwk^i = (0,1) orwk^i = (1,0) 
Proof. Let us start by proving the following claim. 

Claim: Let k,i £ [n]. Then there exists u £ Clsi 2 (.^) such that Uk,i = (1,0) if 
and only if there exists v £ S such that Vk,i = (1, 0) or Vk,i = (0,1). 

Assume first that there exists v £ S such that Vk,i = (0,1). Let x £ S 
such that Xfe = 1 and y £ S such that yi = 0. Without loss of generality, such 
vectors exist by the assumption of Remark]^ Then u := x A {v ^ y) has the 
sought property, i.e Uk,i = (1,0). Assume now that for all v £ S, Vk^i 7 ^ (1,0) 
and Vk,i (0,1). We show that this property is preserved by the application 
of a: A (y —)■ z). For all v £ S, Vk,i = (1,1) or Vk,i = (0,0). Since the function 
X A{y ^ z) acts coordinate-wise on the vectors, if we consider w = x A {y ^ z) 
with x,y, z £ S we must have Wi = Wk- Therefore Wk,i (1,0) and Wk,i 7 ^ (0,1) 


which implies by induction that there is no v with Vk,i = (0, 1 ) and v € Clsi 2 (S). 
We can now prove the theorem. 

(<J=) We can simulate wAv with wA{w^v). We will show that for all i G 0(u) 
either there exists a vector v* G 5 such that l(v) C l(u*) and u* = 0 or we can 
construct it. Notice that it is sufficient in order to prove that v G Clsi 2 i.S) since 
we have v = /\ v*. So let i G ©(u) and assume that for all rc G 5 such that 

IL(u) C 1 ( 1 x 1 ) we have Wi = 1. Let w be such a vector and let l(v) = {ji, J 2 , Jfc}- 
We will construct a sequence of vector {w^)i<k such that for alH < fc and for 
all r < 1, w\ =1 and ic,- = 0. Let be the vector with w] = 1 and w} = 0. 
By the claim, such a vector exists in Clsi2{^)- Now for all I < k, let us define 
w’’ := w A {u’’ -A w^~^) where u’’ is a vector such that = 0 and u[ = \ 
and there is such a vector in Clsi2i^) by the claim. Since by induction we have 
wl~^ = 0, and since u{ = 1, we have (it* —)■ = 0 and thus w\ = 0. Now 

since = 0 and wj, = 1 we have = 1. Finally, for all r < ?, we have wj^ 
and w''~^ = 1. Hence = 1. We obtain that l{v) C l(u;^) and wf = 0. 

{=^) Let V G Clsi2i^)- Notice that if v = x A {y ^ z), then l(u) C t{x). Thus, 
there exists w G S such that l(v) C l(ui). Now, by the claim, for all G [n] 
such that Vk,i = (1,0) there exists w G S such that Wk,i = (1,0) or Wk,i = (0,1) 
which conclude the proof. 

□ 


Corollary 3. Closures^^ is polynomial. 

Finally, we deal with the clone S'lo =< x A {y V z) >. The characterization 
of ClsioiS) we give is very similar to the one of Clsi 2 {^) and the proof works 
in the same way. 

Theorem 5. Let S be a vector set, a vector v belongs to Clsio{S) if and only if 

— there exists w G S such that l(v) C 1 ( 11 ;) 

— for all {k,i) G l(u) x 0(v) there exists w G S with = (1,0) 

Proof. (<J=) Assume first that v G Clsio{S). Notice that if u = x A (y V z), 
then l(v) C l(x). Thus by a simple induction, there exists w G S such that 
l('i’) C l{w). 

Now let (fc,z) G l(v) X ©(u). Let us show that if for all w G 5 Wk,i 7 ^ (1,0), 
then Uki 7 ^ (1,0) for all u G Clsioi^) and then v ^ Clsig{S). It is sufficient to 
show that this property is preserved by the operation xA(yVz). So let a,b and c 
be three boolean vectors such that Ok^i 7 ^ ( 1 , 0 ), bk^i ( 1 , 0 ), Ck^i 7 ^ ( 1 , 0 ) and let 
d = a A (6V c). Assume that di = 0. Then either = 0 or both bi and Ci are 0. If 
Qi = 0 then Ofc = 0 since akg 7 ^ (1, 0) and then dk = akA{bk\/Ck = 0) = 0. Now if 
bi = 0 and Ci = 0 , we have bk = 0 and Cfe = 0 and then dk = akA{bk\/Ck = 0 ) = 0 . 
We conclude that dk,i 7 ^ (0,1) 

{=^) Assume that there exists u G S such that l(v) C l(it) and for all {k,i) G 
l(u) X ©(u) there exists w G S with Wk,i = (1,0). Notice that A G S'lo since 



a A b = a A {b V b). Let t := /\ u. We have l(i') C l{t). Either 

u^S, l(i?)C]L(u) 

V = t or there is i G [n] for which z), = 0 and = 1. For each such coordinate 
i, we will show how to construct a vector t' such that t' = 0 and such that 
IL(u) C l(f') C l{t). Let X := V u. Notice that Xi = 0 and since for all 

u£S, Ui—0 

j S IL(u) there exists w G S such that Wij = (0,1) we have l(u) C l(a;). Now let 
us define t' := t A a::. It is easy to see that t' satisfies the conditions sought. To 
construct t' we proceed as follow. Let {z/^, ...,z/^} := {zi € 5 | zz^ = 0}. Then let 
us construct the following sequence of vectors := tA(z/^Vz/^), := tA{t^Vy^), 

:=tA It is easy to see that t' = , and then t' G CIsiq{S). 

We conclude that v G (S). Indeed starting from t, we can apply the previous 
procedure to set to 0 each index i for which z;i = 0 and ti = 1. □ 

Corollary 4. Closures^j, is polynomial. 


4 Larger Domains 

In this section, we try to extend some results of the boolean domain to larger 
domains. 


4.1 Tractable closure 

We exhibit two families of clones J- such that Closure jr g P. As a result, we 
obtain a polynomial delay algorithm for EnumClosurejt using the backtrack 
search. 

The first tractable case is an extension of the clones of Subsection l3.4l Indeed 
using Th.[^ we can get an equivalent to Corollary. in any domain size. 

Corollary 5. If contains a near unanimity operation, then Closure_f g P. 

In particular, by using the same method as in Prop. [T^ we get the following 
result. 

Proposition 13. If P contains a near unanimity term of arity k, then EnumClosure_f 
can be solved in delay 

We could hope to increase the class of polynomial time decidable problems, 
by using other ideas from CSP. For instance, we may try to prove that if a 
clone P contains a Maltsev operation (it generalizes the majority operation) 
then Closure jr g P. 

The second tractable case is a generalization of Subsection |3.2[ We con¬ 
sider P the clone generated by the addition of two elements over D. To decide 
Closure jF, we have to solve a linear system, which can also be done in polyno¬ 
mial time over any domain. In fact we can further extend this result as shown 
in the next proposition. 


Proposition 14. Let f be a commutative group operation overD, Closure</> € 

P. 

Proof. We want to solve Closure</>, given S a set of vectors and v a vector. 

Let A be the matrix which has the elements of S as rows. The vector v is in 
Closure</> (5) if and only there is a vector x with coefficients in Z such that 
Ax = V. This equation is not over a field so we cannot solve it directly. We apply 
a classical group theorem to the finite commutative group {D,f), which states 
that D is a direct sum of cyclic groups Di,..., Dt whose order is the power of 
a prime. The equation Ax = v can be seen as a set of equations over fields: 

AiXi = Vi, for i < t, where Ai, Xi and Vi are the projection of A, x and v over 
Di. We can easily reconstruct an x which have the projections Xi on Di by the 
Chinese remainder theorem. Therefore, deciding whether v G Closure<^> (5) 
is equivalent to solving a set of linear systems and hence is in polynomial time. 

□ 

One natural generalization would be to allow the function / to be non com¬ 
mutative. In that case, we conjecture that CLOSURE<;y> is NP-hard. 


4.2 A limit to the backtrack search 

The last case we would like to extend is the clone generated by the conjunction. 
A natural generalization is to fix an order on D and to study the complexity of 
Closure</> with / monotone. Let / be the function over D = {0,1, 2} defined 
hy f{x,y) = x + yiix + y < 2 and f{x, y) = 2 otherwise. This function is clearly 
monotone for the usual order. 

Proposition 15. Closure</> is -complete. 

Proof. We reduce EXACT-3-COVER to Closure<j>. Let S be an instance of 
EXACT-3-COVER, that is a set of subsets of [n] of size 3. Clearly, S can be 
seen as an instance of Closure</> and we prove that 1 G Closure</> (5) 
if and only if there is an exact cover of S. First remark that / associative, 
therefore any element of Closure</>(5) can be written f{vi,f{v 2 ,f{v 3 ,...) 
with Vi G S. It is also commutative therefore we can associate a unique element 
of Closure</>(5) to a multiset of elements of S by the previous construction. 
Remark that it is never useful to have three times the same element in the 
multiset since f{v,v) = f{v,f{v,v)). If Vi > 0 then f{vi,Vi) = 2, therefore the 
vector 1 can only be generated by a set and not a multiset. Moreover a set which 
generates 1 satisfies that for alH < n there is one and only one of its elements 
with a coefficient 1 at the index i. Such a set is an exact cover of S, which proves 
the reduction. 

The problem is in NP because an element v is in Closure</>(5), if and 
only if there is a multiset of elements of S such that applying / to its elements 
yields v. This witness is of polynomial size since each element is at most twice 
in the multiset. □ 



This hardness result implies that we cannot use the backtrack search to solve 
the associated enumeration algorithm. However, if we allow a space proportional 
to the number of solutions, we can still get a polynomial delay algorithm for as¬ 
sociative functions, a property satisfied by the function / of the last proposition. 
Remark that the space used can be exponential while the backtrack search only 
requires a polynomial space. 

Proposition 16. If f is an associative function, then EnumClosure</> e 
DelayP. 

Proof. Let S be an instance of ENUMCLOSURE<y>. Let G be the directed graph 
with vertices C'l</>(5) and from each v € there is an arc to f{v,s) 

for all s G S. Since / is associative, by definition of G, every vertex of C'Z</>(5) 
is accessible from a vertex in S. Therefore we can do a depth-first traversal of 
the graph G to enumerate all solutions. A step of the traversal is in polynomial 
time: from an element v we generate its neighborhood: f{v, s) for s € S. The 
computation of f{v, s) is in time 0{n) and |5| = m. We must also test whether 
the solution f{v, s) has already been generated. This can be done in time 0{n) 
by maintaining a self balanced search tree containing the generated solutions, 
since there are at most |I?|" solutions. In conclusion the delay of the enumeration 
algorithm is in 0{mn) thus polynomial. □ 

To obtain a polynomial space algorithm, we could try to use the reverse 
search method. To do that, we want the graph G to be a directed acyclic graph, 
which is the case if we require the function to be monotone. The monotonicity 
also ensures that the depth of G is at most n{\D\ — 1). However we also need 
to be able to compute for each element of G a canonical ancestor in polynomial 
time and it does not seem to be easy even when / is monotone. We leave the 
question of finding a good property of / which ensures the existence of an easy 
to compute ancestor open for future research. 

5 Further work 

— Classify the complexity of Closure jr for domains larger than two. 

— Find T such that EnumClosurEjt can be solved with a polynomial delay 
and space but such that ClosureJ^ is NP-hard. 

— For set family (boolean domain) can we enumerate only the minimal ele¬ 
ments (backtrack fails even with the symmetric difference)? It would give an 
enumeration algorithm of the circuits of binary matroids. 

— What if we only allow operations between elements satisfying a given prop¬ 
erty (if the intersection is not empty for instance). 

— Can we allow the closure function to have a different action on different 
coefficients and still obtain the same kind of results ? 
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